Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Annotated Chemical Patent Corpus: A Gold Standard for Text Mining

Identifieur interne : 000110 ( Main/Exploration ); précédent : 000109; suivant : 000111

Annotated Chemical Patent Corpus: A Gold Standard for Text Mining

Auteurs : Saber A. Akhondi [Pays-Bas] ; Alexander G. Klenner [Allemagne] ; Christian Tyrchan [Suède] ; Anil K. Manchala [Inde] ; Kiran Boppana [Inde] ; Daniel Lowe [Royaume-Uni] ; Marc Zimmermann [Allemagne] ; Sarma A. R. P. Jagarlapudi [Inde] ; Roger Sayle [Royaume-Uni] ; Jan A. Kors [Pays-Bas] ; Sorel Muresan [Suède]

Source :

RBID : PMC:4182036

Abstract

Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.


Url:
DOI: 10.1371/journal.pone.0107477
PubMed: 25268232
PubMed Central: 4182036


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Annotated Chemical Patent Corpus: A Gold Standard for Text Mining</title>
<author>
<name sortKey="Akhondi, Saber A" sort="Akhondi, Saber A" uniqKey="Akhondi S" first="Saber A." last="Akhondi">Saber A. Akhondi</name>
<affiliation wicri:level="3">
<nlm:aff id="aff1">
<addr-line>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, The Netherlands</addr-line>
</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam</wicri:regionArea>
<placeName>
<settlement type="city">Rotterdam</settlement>
<region nuts="2" type="province">Hollande-Méridionale</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Klenner, Alexander G" sort="Klenner, Alexander G" uniqKey="Klenner A" first="Alexander G." last="Klenner">Alexander G. Klenner</name>
<affiliation wicri:level="3">
<nlm:aff id="aff2">
<addr-line>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin, Germany</addr-line>
</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Sankt Augustin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Tyrchan, Christian" sort="Tyrchan, Christian" uniqKey="Tyrchan C" first="Christian" last="Tyrchan">Christian Tyrchan</name>
<affiliation wicri:level="1">
<nlm:aff id="aff3">
<addr-line>RIA Medicinal Chemistry, AstraZeneca R&D Mölndal, Mölndal, Sweden</addr-line>
</nlm:aff>
<country xml:lang="fr">Suède</country>
<wicri:regionArea>RIA Medicinal Chemistry, AstraZeneca R&D Mölndal, Mölndal</wicri:regionArea>
<wicri:noRegion>Mölndal</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Manchala, Anil K" sort="Manchala, Anil K" uniqKey="Manchala A" first="Anil K." last="Manchala">Anil K. Manchala</name>
<affiliation wicri:level="1">
<nlm:aff id="aff4">
<addr-line>GVK Biosciences Private Limited, Hyderabad, India</addr-line>
</nlm:aff>
<country xml:lang="fr">Inde</country>
<wicri:regionArea>GVK Biosciences Private Limited, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Boppana, Kiran" sort="Boppana, Kiran" uniqKey="Boppana K" first="Kiran" last="Boppana">Kiran Boppana</name>
<affiliation wicri:level="1">
<nlm:aff id="aff4">
<addr-line>GVK Biosciences Private Limited, Hyderabad, India</addr-line>
</nlm:aff>
<country xml:lang="fr">Inde</country>
<wicri:regionArea>GVK Biosciences Private Limited, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Lowe, Daniel" sort="Lowe, Daniel" uniqKey="Lowe D" first="Daniel" last="Lowe">Daniel Lowe</name>
<affiliation wicri:level="2">
<nlm:aff id="aff5">
<addr-line>NextMove Software Ltd, Cambridge, England</addr-line>
</nlm:aff>
<country>Royaume-Uni</country>
<placeName>
<region type="country">Angleterre</region>
</placeName>
<wicri:cityArea>NextMove Software Ltd, Cambridge</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Zimmermann, Marc" sort="Zimmermann, Marc" uniqKey="Zimmermann M" first="Marc" last="Zimmermann">Marc Zimmermann</name>
<affiliation wicri:level="3">
<nlm:aff id="aff6">
<addr-line>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin, Germany</addr-line>
</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Sankt Augustin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Jagarlapudi, Sarma A R P" sort="Jagarlapudi, Sarma A R P" uniqKey="Jagarlapudi S" first="Sarma A. R. P." last="Jagarlapudi">Sarma A. R. P. Jagarlapudi</name>
<affiliation wicri:level="1">
<nlm:aff id="aff4">
<addr-line>GVK Biosciences Private Limited, Hyderabad, India</addr-line>
</nlm:aff>
<country xml:lang="fr">Inde</country>
<wicri:regionArea>GVK Biosciences Private Limited, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Sayle, Roger" sort="Sayle, Roger" uniqKey="Sayle R" first="Roger" last="Sayle">Roger Sayle</name>
<affiliation wicri:level="2">
<nlm:aff id="aff5">
<addr-line>NextMove Software Ltd, Cambridge, England</addr-line>
</nlm:aff>
<country>Royaume-Uni</country>
<placeName>
<region type="country">Angleterre</region>
</placeName>
<wicri:cityArea>NextMove Software Ltd, Cambridge</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Kors, Jan A" sort="Kors, Jan A" uniqKey="Kors J" first="Jan A." last="Kors">Jan A. Kors</name>
<affiliation wicri:level="3">
<nlm:aff id="aff1">
<addr-line>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, The Netherlands</addr-line>
</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam</wicri:regionArea>
<placeName>
<settlement type="city">Rotterdam</settlement>
<region nuts="2" type="province">Hollande-Méridionale</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Muresan, Sorel" sort="Muresan, Sorel" uniqKey="Muresan S" first="Sorel" last="Muresan">Sorel Muresan</name>
<affiliation wicri:level="1">
<nlm:aff id="aff7">
<addr-line>Chemistry Innovation Centre, AstraZeneca R&D Mölndal, Mölndal, Sweden</addr-line>
</nlm:aff>
<country xml:lang="fr">Suède</country>
<wicri:regionArea>Chemistry Innovation Centre, AstraZeneca R&D Mölndal, Mölndal</wicri:regionArea>
<wicri:noRegion>Mölndal</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">25268232</idno>
<idno type="pmc">4182036</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4182036</idno>
<idno type="RBID">PMC:4182036</idno>
<idno type="doi">10.1371/journal.pone.0107477</idno>
<date when="2014">2014</date>
<idno type="wicri:Area/Pmc/Corpus">000014</idno>
<idno type="wicri:Area/Pmc/Curation">000014</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000057</idno>
<idno type="wicri:Area/Ncbi/Merge">000211</idno>
<idno type="wicri:Area/Ncbi/Curation">000211</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000211</idno>
<idno type="wicri:Area/Main/Merge">000111</idno>
<idno type="wicri:Area/Main/Curation">000110</idno>
<idno type="wicri:Area/Main/Exploration">000110</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Annotated Chemical Patent Corpus: A Gold Standard for Text Mining</title>
<author>
<name sortKey="Akhondi, Saber A" sort="Akhondi, Saber A" uniqKey="Akhondi S" first="Saber A." last="Akhondi">Saber A. Akhondi</name>
<affiliation wicri:level="3">
<nlm:aff id="aff1">
<addr-line>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, The Netherlands</addr-line>
</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam</wicri:regionArea>
<placeName>
<settlement type="city">Rotterdam</settlement>
<region nuts="2" type="province">Hollande-Méridionale</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Klenner, Alexander G" sort="Klenner, Alexander G" uniqKey="Klenner A" first="Alexander G." last="Klenner">Alexander G. Klenner</name>
<affiliation wicri:level="3">
<nlm:aff id="aff2">
<addr-line>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin, Germany</addr-line>
</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Sankt Augustin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Tyrchan, Christian" sort="Tyrchan, Christian" uniqKey="Tyrchan C" first="Christian" last="Tyrchan">Christian Tyrchan</name>
<affiliation wicri:level="1">
<nlm:aff id="aff3">
<addr-line>RIA Medicinal Chemistry, AstraZeneca R&D Mölndal, Mölndal, Sweden</addr-line>
</nlm:aff>
<country xml:lang="fr">Suède</country>
<wicri:regionArea>RIA Medicinal Chemistry, AstraZeneca R&D Mölndal, Mölndal</wicri:regionArea>
<wicri:noRegion>Mölndal</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Manchala, Anil K" sort="Manchala, Anil K" uniqKey="Manchala A" first="Anil K." last="Manchala">Anil K. Manchala</name>
<affiliation wicri:level="1">
<nlm:aff id="aff4">
<addr-line>GVK Biosciences Private Limited, Hyderabad, India</addr-line>
</nlm:aff>
<country xml:lang="fr">Inde</country>
<wicri:regionArea>GVK Biosciences Private Limited, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Boppana, Kiran" sort="Boppana, Kiran" uniqKey="Boppana K" first="Kiran" last="Boppana">Kiran Boppana</name>
<affiliation wicri:level="1">
<nlm:aff id="aff4">
<addr-line>GVK Biosciences Private Limited, Hyderabad, India</addr-line>
</nlm:aff>
<country xml:lang="fr">Inde</country>
<wicri:regionArea>GVK Biosciences Private Limited, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Lowe, Daniel" sort="Lowe, Daniel" uniqKey="Lowe D" first="Daniel" last="Lowe">Daniel Lowe</name>
<affiliation wicri:level="2">
<nlm:aff id="aff5">
<addr-line>NextMove Software Ltd, Cambridge, England</addr-line>
</nlm:aff>
<country>Royaume-Uni</country>
<placeName>
<region type="country">Angleterre</region>
</placeName>
<wicri:cityArea>NextMove Software Ltd, Cambridge</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Zimmermann, Marc" sort="Zimmermann, Marc" uniqKey="Zimmermann M" first="Marc" last="Zimmermann">Marc Zimmermann</name>
<affiliation wicri:level="3">
<nlm:aff id="aff6">
<addr-line>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin, Germany</addr-line>
</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Fraunhofer-Institute for Algorithms and Scientific Computing (SCAI), Fraunhofer-Gesellschaft, Sankt Augustin</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Rhénanie-du-Nord-Westphalie</region>
<region type="district" nuts="2">District de Cologne</region>
<settlement type="city">Sankt Augustin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Jagarlapudi, Sarma A R P" sort="Jagarlapudi, Sarma A R P" uniqKey="Jagarlapudi S" first="Sarma A. R. P." last="Jagarlapudi">Sarma A. R. P. Jagarlapudi</name>
<affiliation wicri:level="1">
<nlm:aff id="aff4">
<addr-line>GVK Biosciences Private Limited, Hyderabad, India</addr-line>
</nlm:aff>
<country xml:lang="fr">Inde</country>
<wicri:regionArea>GVK Biosciences Private Limited, Hyderabad</wicri:regionArea>
<wicri:noRegion>Hyderabad</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Sayle, Roger" sort="Sayle, Roger" uniqKey="Sayle R" first="Roger" last="Sayle">Roger Sayle</name>
<affiliation wicri:level="2">
<nlm:aff id="aff5">
<addr-line>NextMove Software Ltd, Cambridge, England</addr-line>
</nlm:aff>
<country>Royaume-Uni</country>
<placeName>
<region type="country">Angleterre</region>
</placeName>
<wicri:cityArea>NextMove Software Ltd, Cambridge</wicri:cityArea>
</affiliation>
</author>
<author>
<name sortKey="Kors, Jan A" sort="Kors, Jan A" uniqKey="Kors J" first="Jan A." last="Kors">Jan A. Kors</name>
<affiliation wicri:level="3">
<nlm:aff id="aff1">
<addr-line>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, The Netherlands</addr-line>
</nlm:aff>
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam</wicri:regionArea>
<placeName>
<settlement type="city">Rotterdam</settlement>
<region nuts="2" type="province">Hollande-Méridionale</region>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Muresan, Sorel" sort="Muresan, Sorel" uniqKey="Muresan S" first="Sorel" last="Muresan">Sorel Muresan</name>
<affiliation wicri:level="1">
<nlm:aff id="aff7">
<addr-line>Chemistry Innovation Centre, AstraZeneca R&D Mölndal, Mölndal, Sweden</addr-line>
</nlm:aff>
<country xml:lang="fr">Suède</country>
<wicri:regionArea>Chemistry Innovation Centre, AstraZeneca R&D Mölndal, Mölndal</wicri:regionArea>
<wicri:noRegion>Mölndal</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j">PLoS ONE</title>
<idno type="eISSN">1932-6203</idno>
<imprint>
<date when="2014">2014</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at
<ext-link ext-link-type="uri" xlink:href="http://www.biosemantics.org">www.biosemantics.org</ext-link>
.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Muresan, S" uniqKey="Muresan S">S Muresan</name>
</author>
<author>
<name sortKey="Petrov, P" uniqKey="Petrov P">P Petrov</name>
</author>
<author>
<name sortKey="Southan, C" uniqKey="Southan C">C Southan</name>
</author>
<author>
<name sortKey="Kjellberg, Mj" uniqKey="Kjellberg M">MJ Kjellberg</name>
</author>
<author>
<name sortKey="Kogej, T" uniqKey="Kogej T">T Kogej</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Southan, C" uniqKey="Southan C">C Southan</name>
</author>
<author>
<name sortKey="Boppana, K" uniqKey="Boppana K">K Boppana</name>
</author>
<author>
<name sortKey="Jagarlapudi, Sa" uniqKey="Jagarlapudi S">SA Jagarlapudi</name>
</author>
<author>
<name sortKey="Muresan, S" uniqKey="Muresan S">S Muresan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tyrchan, C" uniqKey="Tyrchan C">C Tyrchan</name>
</author>
<author>
<name sortKey="Bostrom, J" uniqKey="Bostrom J">J Boström</name>
</author>
<author>
<name sortKey="Giordanetto, F" uniqKey="Giordanetto F">F Giordanetto</name>
</author>
<author>
<name sortKey="Winter, J" uniqKey="Winter J">J Winter</name>
</author>
<author>
<name sortKey="Muresan, S" uniqKey="Muresan S">S Muresan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kolarik, C" uniqKey="Kolarik C">C Kolarik</name>
</author>
<author>
<name sortKey="Hofmann Apitius, M" uniqKey="Hofmann Apitius M">M Hofmann-Apitius</name>
</author>
<author>
<name sortKey="Zimmermann, M" uniqKey="Zimmermann M">M Zimmermann</name>
</author>
<author>
<name sortKey="Fluck, J" uniqKey="Fluck J">J Fluck</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Klinger, R" uniqKey="Klinger R">R Klinger</name>
</author>
<author>
<name sortKey="Kolarik, C" uniqKey="Kolarik C">C Kolarik</name>
</author>
<author>
<name sortKey="Fluck, J" uniqKey="Fluck J">J Fluck</name>
</author>
<author>
<name sortKey="Hofmann Apitius, M" uniqKey="Hofmann Apitius M">M Hofmann-Apitius</name>
</author>
<author>
<name sortKey="Friedrich, Cm" uniqKey="Friedrich C">CM Friedrich</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zimmermann, M" uniqKey="Zimmermann M">M Zimmermann</name>
</author>
<author>
<name sortKey="Fluck, J" uniqKey="Fluck J">J Fluck</name>
</author>
<author>
<name sortKey="Thi, Lt" uniqKey="Thi L">LT Thi</name>
</author>
<author>
<name sortKey="Kolarik, C" uniqKey="Kolarik C">C Kolarik</name>
</author>
<author>
<name sortKey="Kumpf, K" uniqKey="Kumpf K">K Kumpf</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tseng, Y H" uniqKey="Tseng Y">Y-H Tseng</name>
</author>
<author>
<name sortKey="Lin, C J" uniqKey="Lin C">C-J Lin</name>
</author>
<author>
<name sortKey="Lin, Y I" uniqKey="Lin Y">Y-I Lin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jessop, Dm" uniqKey="Jessop D">DM Jessop</name>
</author>
<author>
<name sortKey="Adams, Se" uniqKey="Adams S">SE Adams</name>
</author>
<author>
<name sortKey="Murray Rust, P" uniqKey="Murray Rust P">P Murray-Rust</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kiss, M" uniqKey="Kiss M">M Kiss</name>
</author>
<author>
<name sortKey="Nagy, A" uniqKey="Nagy A">Á Nagy</name>
</author>
<author>
<name sortKey="Vincze, V" uniqKey="Vincze V">V Vincze</name>
</author>
<author>
<name sortKey="Almasi, A" uniqKey="Almasi A">A Almási</name>
</author>
<author>
<name sortKey="Alexin, Z" uniqKey="Alexin Z">Z Alexin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Vazquez, M" uniqKey="Vazquez M">M Vazquez</name>
</author>
<author>
<name sortKey="Krallinger, M" uniqKey="Krallinger M">M Krallinger</name>
</author>
<author>
<name sortKey="Leitner, F" uniqKey="Leitner F">F Leitner</name>
</author>
<author>
<name sortKey="Valencia, A" uniqKey="Valencia A">A Valencia</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kim, Jd" uniqKey="Kim J">JD Kim</name>
</author>
<author>
<name sortKey="Ohta, T" uniqKey="Ohta T">T Ohta</name>
</author>
<author>
<name sortKey="Tateisi, Y" uniqKey="Tateisi Y">Y Tateisi</name>
</author>
<author>
<name sortKey="Tsujii, J" uniqKey="Tsujii J">J Tsujii</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kulick, S" uniqKey="Kulick S">S Kulick</name>
</author>
<author>
<name sortKey="Bies, A" uniqKey="Bies A">A Bies</name>
</author>
<author>
<name sortKey="Liberman, M" uniqKey="Liberman M">M Liberman</name>
</author>
<author>
<name sortKey="Mandel, M" uniqKey="Mandel M">M Mandel</name>
</author>
<author>
<name sortKey="Mcdonald, R" uniqKey="Mcdonald R">R McDonald</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kolarik, C" uniqKey="Kolarik C">C Kolárik</name>
</author>
<author>
<name sortKey="Klinger, R" uniqKey="Klinger R">R Klinger</name>
</author>
<author>
<name sortKey="Friedrich, Cm" uniqKey="Friedrich C">CM Friedrich</name>
</author>
<author>
<name sortKey="Hofmann Apitius, M" uniqKey="Hofmann Apitius M">M Hofmann-Apitius</name>
</author>
<author>
<name sortKey="Fluck, J" uniqKey="Fluck J">J Fluck</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Krallinger, M" uniqKey="Krallinger M">M Krallinger</name>
</author>
<author>
<name sortKey="Leitner, F" uniqKey="Leitner F">F Leitner</name>
</author>
<author>
<name sortKey="Rabal, O" uniqKey="Rabal O">O Rabal</name>
</author>
<author>
<name sortKey="Vazquez, M" uniqKey="Vazquez M">M Vazquez</name>
</author>
<author>
<name sortKey="Oyarzabal, J" uniqKey="Oyarzabal J">J Oyarzabal</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Grego, T" uniqKey="Grego T">T Grego</name>
</author>
<author>
<name sortKey="P Zik, P" uniqKey="P Zik P">P Pęzik</name>
</author>
<author>
<name sortKey="Couto, Fm" uniqKey="Couto F">FM Couto</name>
</author>
<author>
<name sortKey="Rebholz Schuhmann, D" uniqKey="Rebholz Schuhmann D">D Rebholz-Schuhmann</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Degtyarenko, K" uniqKey="Degtyarenko K">K Degtyarenko</name>
</author>
<author>
<name sortKey="De Matos, P" uniqKey="De Matos P">P De Matos</name>
</author>
<author>
<name sortKey="Ennis, M" uniqKey="Ennis M">M Ennis</name>
</author>
<author>
<name sortKey="Hastings, J" uniqKey="Hastings J">J Hastings</name>
</author>
<author>
<name sortKey="Zbinden, M" uniqKey="Zbinden M">M Zbinden</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="De Matos, P" uniqKey="De Matos P">P De Matos</name>
</author>
<author>
<name sortKey="Alcantara, R" uniqKey="Alcantara R">R Alcántara</name>
</author>
<author>
<name sortKey="Dekker, A" uniqKey="Dekker A">A Dekker</name>
</author>
<author>
<name sortKey="Ennis, M" uniqKey="Ennis M">M Ennis</name>
</author>
<author>
<name sortKey="Hastings, J" uniqKey="Hastings J">J Hastings</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sayle, R" uniqKey="Sayle R">R Sayle</name>
</author>
<author>
<name sortKey="Xie, Ph" uniqKey="Xie P">PH Xie</name>
</author>
<author>
<name sortKey="Muresan, S" uniqKey="Muresan S">S Muresan</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Akhondi, Sa" uniqKey="Akhondi S">SA Akhondi</name>
</author>
<author>
<name sortKey="Kors, Ja" uniqKey="Kors J">JA Kors</name>
</author>
<author>
<name sortKey="Muresan, S" uniqKey="Muresan S">S Muresan</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Weininger, D" uniqKey="Weininger D">D Weininger</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Heller, S" uniqKey="Heller S">S Heller</name>
</author>
<author>
<name sortKey="Mcnaught, A" uniqKey="Mcnaught A">A McNaught</name>
</author>
<author>
<name sortKey="Stein, S" uniqKey="Stein S">S Stein</name>
</author>
<author>
<name sortKey="Tchekhovskoi, D" uniqKey="Tchekhovskoi D">D Tchekhovskoi</name>
</author>
<author>
<name sortKey="Pletnev, I" uniqKey="Pletnev I">I Pletnev</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lowe, Dm" uniqKey="Lowe D">DM Lowe</name>
</author>
<author>
<name sortKey="Sayle, Ra" uniqKey="Sayle R">RA Sayle</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Stenetorp, P" uniqKey="Stenetorp P">P Stenetorp</name>
</author>
<author>
<name sortKey="Pyysalo, S" uniqKey="Pyysalo S">S Pyysalo</name>
</author>
<author>
<name sortKey="Topi, G" uniqKey="Topi G">G Topić</name>
</author>
<author>
<name sortKey="Ohta, T" uniqKey="Ohta T">T Ohta</name>
</author>
<author>
<name sortKey="Ananiadou, S" uniqKey="Ananiadou S">S Ananiadou</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lewin, I" uniqKey="Lewin I">I Lewin</name>
</author>
<author>
<name sortKey="Kafkas, S" uniqKey="Kafkas S">S Kafkas</name>
</author>
<author>
<name sortKey="Rebholz Schuhmann, D" uniqKey="Rebholz Schuhmann D">D Rebholz-Schuhmann</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<affiliations>
<list>
<country>
<li>Allemagne</li>
<li>Inde</li>
<li>Pays-Bas</li>
<li>Royaume-Uni</li>
<li>Suède</li>
</country>
<region>
<li>Angleterre</li>
<li>District de Cologne</li>
<li>Hollande-Méridionale</li>
<li>Rhénanie-du-Nord-Westphalie</li>
</region>
<settlement>
<li>Rotterdam</li>
<li>Sankt Augustin</li>
</settlement>
</list>
<tree>
<country name="Pays-Bas">
<region name="Hollande-Méridionale">
<name sortKey="Akhondi, Saber A" sort="Akhondi, Saber A" uniqKey="Akhondi S" first="Saber A." last="Akhondi">Saber A. Akhondi</name>
</region>
<name sortKey="Kors, Jan A" sort="Kors, Jan A" uniqKey="Kors J" first="Jan A." last="Kors">Jan A. Kors</name>
</country>
<country name="Allemagne">
<region name="Rhénanie-du-Nord-Westphalie">
<name sortKey="Klenner, Alexander G" sort="Klenner, Alexander G" uniqKey="Klenner A" first="Alexander G." last="Klenner">Alexander G. Klenner</name>
</region>
<name sortKey="Zimmermann, Marc" sort="Zimmermann, Marc" uniqKey="Zimmermann M" first="Marc" last="Zimmermann">Marc Zimmermann</name>
</country>
<country name="Suède">
<noRegion>
<name sortKey="Tyrchan, Christian" sort="Tyrchan, Christian" uniqKey="Tyrchan C" first="Christian" last="Tyrchan">Christian Tyrchan</name>
</noRegion>
<name sortKey="Muresan, Sorel" sort="Muresan, Sorel" uniqKey="Muresan S" first="Sorel" last="Muresan">Sorel Muresan</name>
</country>
<country name="Inde">
<noRegion>
<name sortKey="Manchala, Anil K" sort="Manchala, Anil K" uniqKey="Manchala A" first="Anil K." last="Manchala">Anil K. Manchala</name>
</noRegion>
<name sortKey="Boppana, Kiran" sort="Boppana, Kiran" uniqKey="Boppana K" first="Kiran" last="Boppana">Kiran Boppana</name>
<name sortKey="Jagarlapudi, Sarma A R P" sort="Jagarlapudi, Sarma A R P" uniqKey="Jagarlapudi S" first="Sarma A. R. P." last="Jagarlapudi">Sarma A. R. P. Jagarlapudi</name>
</country>
<country name="Royaume-Uni">
<region name="Angleterre">
<name sortKey="Lowe, Daniel" sort="Lowe, Daniel" uniqKey="Lowe D" first="Daniel" last="Lowe">Daniel Lowe</name>
</region>
<name sortKey="Sayle, Roger" sort="Sayle, Roger" uniqKey="Sayle R" first="Roger" last="Sayle">Roger Sayle</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000110 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000110 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     PMC:4182036
   |texte=   Annotated Chemical Patent Corpus: A Gold Standard for Text Mining
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:25268232" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024